# Edge-side Multimodal
Xinyuan VL 2B
Apache-2.0
Xinyuan-VL-2B is a high-performance multimodal large model for edge-side applications launched by Cylingo Group, fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, utilizing over 5 million multimodal data points and a small amount of pure text data.
Text-to-Image
Transformers Supports Multiple Languages

X
Cylingo
94
7
Minicpm Llama3 V 2 5
MiniCPM-V 2.6 is a multimodal large model launched by OpenBMB, surpassing GPT-4V in single-image, multi-image, and video understanding tasks, and supports real-time video understanding on iPad.
Image-to-Text
Transformers Other

M
openbmb
31.48k
1,394
Featured Recommended AI Models